5 WHAT IS CLAIMED IS; 

1. A method for generating summaries of a video 
comprising the steps of: 

inputting summary sentences, visual information 
and a section-begin frame and a section-end frame for each 

10 story in a video; 

selecting a type of presentation; 

locating a set of images available for each 

story; 

auditing the summary sentences to generate an 
15 auditory narration of each story; 

matching said audited summary sentences with the 
set of images to generate a story summary video for each 
story in the video; and 

combining each of the generated story summaries 
20 to generate a summary of the video. 

2. The method of claim 1, wherein the visual information 
comprises at least one of a shotlist, a keyframelist and a 
combination thereof. 
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3. The method of claim 1, wherein the summary sentences 

are generated by: 

generating story boundaries and sentence data 

5 using a story extractor; 

selecting a length of a story summary; 
summarizing said sentence data to produce at 
least one summary sentence, wherein a number of the summary 
sentences produced corresponds to the length of the story 

10 summary; and 

ordering the at least one summary sentence based 

on its selection order. 

4. The method of claim 1, wherein the type of 
15 presentation comprises an image slide format. 

5. The method of claim 1, wherein the type of 
presentation comprises a poster format. 

20 6. The method of claim 1, wherein the section-begin frame 
and the sect ion- end frame determines a story boundary. 

7. The method of claim 1, wherein the step of locating 
the set of images further comprises the steps of: 
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collecting a list of images within a story 

boundary ; 

generating a mergelist for clustering images 
corresponding to each shot into visually similar groups; 
5 deleting images belonging to a largest visually 

similar group; and 

sampling a remaining list of images to produce 
the set of images. 

10 8. The method of claim 7, wherein the sampling is 

performed uniformly with a sampling interval determined by 
a number of images desired for a given length of story 
summary . 

15 9. The method of claim 7, wherein the step of sampling 
further comprises selecting a frame number of each proper 
noun. 

10. A method for generating summaries of a video, 
20 comprising the steps of: 

inputting story summary sentences, video 
information and speaker segments for each story in a video; 

locating video clips for each stofry from said 
video information; 
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capturing audio clips from the video clips, said 
audio clips corresponding to the summary sentences; 

combining said corresponding audio clips with the 
video clips to generate a story summary video for each 
story in the video; and 

combining each of the generated story summaries 
to generate a summary of the video. 

11. The method of claim 10, wherein the summary sentences 
are generated by: 

generating story boundaries and sentence data 
using a story extractor; 

selecting a length of a story summary; 

summarizing said sentence data to produce at 
least one summary sentence, wherein a number of the summary 
sentences produced corresponds to the length of the story 
summary; and 

ordering the at least one summary sentence based 
on its selection order. 

12. A program storage device readable by a machine, 
tangibly embodying a program of instructions executable by 
the machine to perform method steps for generating 
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summaries of a video, the method steps comprising the steps 
of: 

providing summary sentences, visual information 
and a section-begin frame and a section-end frame for each 
5 story in a video; 

selecting a type of presentation; 

locating a set of images available for each 

story; 

auditing the summary sentences to generate an 
10 auditory narration of each story; and 

matching said audited summary sentences with the 
set of images to generate a story summary video for each 
story in the video, wherein a summary of the video is 
generated by combining each of the generated story 
15 summaries . 

13. The program storage device of claim 12, wherein the 
visual information comprises at least one of a shotlist, a 
keyframelist and a combination thereof. 

20 

14. The program storage device of claim 12, wherein the 
instructions for generating summary sentences comprise 
instructions for performing the steps of: 
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generating story boundaries and sentence data 
using a story extractor; 

selecting a length of a story summary; 

summarizing said sentence data to produce at 
least one summary sentence, wherein a number of the summary 
sentences produced corresponds to the length of the story 
summary; and 

ordering the at least one summary sentence based 
on its selection order. 

15. The program storage device of claim 12, wherein the 
type of presentation comprises an image slide format. 

16. The program storage device of claim 12, wherein the 
type of presentation comprises a poster format. 

17. The program storage device of claim 12, wherein the 
section-begin frame and the section-end frame determines a 
story boundary. 

18. The program storage device of claim 12, wherein the 
step of locating the set of images further comprises the 
steps of : 
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collecting a list of images within a story 

boundary; 

generating a mergelist for clustering images 
corresponding to each shot into visually similar groups; 
5 deleting images belonging to a largest visually 

similar group; and 

sampling a remaining list of images to produce 
the set of images . 

10 19. The program storage device of claim 18, wherein the 
sampling is performed uniformly with a sampling interval 
determined by a number of images desired for a given length 
of story summary, 

15 20. The program storage device of claim 18, wherein the 

step of sampling further comprises selecting a frame number 
of each proper noun. 
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